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OPTIMAL DETECTION OF A CHANGE-SET IN 
A SPATIAL POISSON PROCESS 

By B. Gail Ivanoff 1 and Ely Merzbach 

University of Ottawa and Bar-Ilan University 

We generalize the classic change-point problem to a "change- 
set" framework: a spatial Poisson process changes its intensity on an 
unobservable random set. Optimal detection of the set is defined by 
maximizing the expected value of a gain function. In the case that the 
unknown change-set is defined by a locally finite set of incomparable 
points, we present a sufficient condition for optimal detection of the 
set using multiparameter martingale techniques. Two examples are 
discussed. 

1. Introduction. In this paper, we consider the multiparameter version 
of the classic optimal detection problem; the goal is to detect the occurrence 
of a random set on which an observable Poisson process changes its intensity. 
To be precise, we let N = {Nt,t € R+} be a nonexplosive point process 
defined on the positive quadrant of the plane and let {r n } be its jump 
points, numbered in some arbitrary way. Then N t = YlnLi -^{t„<*} (°f- [6])- 
Here, "<" denotes the usual partial order on R/j_ : s = (s±, S2) <t = (ii, £2) ^ 
si < t\, S2 < t2- On some random set £, the intensity of -/V changes from /iq 
to fii, where < hq < specifically, given £, ./V is a Poisson process with 
intensity 

f-ol^} + Ml-fye?} = Mo + (Mi ~ MoK{te£}- 

The problem is that the "change-set" £ is unobservable and we must detect 
£ as well as possible, given our observation of the point process N. In par- 
ticular, our goal is to find a random set £ that maximizes the expected value 
of a specified valuation or gain function. The random set £ must be adapted 
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to the underlying information structure: if the information available to us 
at t € R+ is represented by the <7-field Ft , then we must have {t £ £} € Tt ■ 
There are many potential areas of application. For example: 

• Environment: The increased occurrence of polluted wells in a rural area 
could indicate a geographic region that has been subjected to industrial 
waste. 

• Population health: Unusually frequent outbreaks of a disease such as 
leukemia near a nuclear power plant could signal a region of possible 
air or ground contamination. 

• Astronomy: A cluster of black holes could be the result of an unobservable 
phenomenon affecting a region in space. 

• Quality control: An increased rate of breakdowns in a certain type of 
equipment might follow the failure of one or more components. 

• Archaeology: An increased number of archaeological items such as ancient 
coins found in a particular region could indicate the location of an event 
of historical interest. 

• Forestry: The spread of an airborne disease through a forest would occur 
at a higher rate on £, the set of points to the northeast of the (unobserved) 
point (a) of initial infection if the prevailing winds are from the southwest. 

It is this final type of example, illustrated in Figure 1, that motivates the 
model to be studied in this paper. 




= observed points of N, q = a, unobserved jump point of L 
Fig. 1. A change-set £ generated by a single point a. 
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As will be discussed in the conclusion, this paper represents only a first 
step in the solution of what we call the "optimal set-detection problem." 
Here, we consider the case in which the change-set £ is a random upper 
layer (cf. Section 2) generated by a locally finite set of incomparable points. 
In general, the optimal solution £ will be a random upper layer which is 
adapted to the available information structure. This means that the solution 
is exact in the sense that it is explicitly defined by the observed data points. 
This problem cannot be solved by one-parameter methods. Indeed, even if 
the random set is characterized by a single change-point, it will be seen that 
the optimal solution does not necessarily correspond to a point. 

In the one-parameter case, the optimal detection of an exponential change 
time in a Poisson process was thoroughly studied in [5] using martingale 
techniques combined with Bayesian arguments (see also [13] for a different 
approach to the same problem). In the general set-indexed framework, we 
found only a very few papers addressing the problem of a change-point or a 
change-set (cf. [4], [10] and [11]). However, none of these papers deal with the 
question of the existence of an optimal solution to the detection problem. 
Our approach, inspired by that of [5], makes use of the general theory of 
set-indexed martingales as developed in [6]. We are then able to solve the 
problem with a Bayes-type formula. 

The paper is structured as follows. In the next section, the model is pre- 
sented and the optimal detection problem is formally defined. In Section 3, 
we give the necessary background for the multiparameter martingale ap- 
proach that is the key for proving the existence of an optimal solution, and 
develop a semimartingale representation of the gain function. In Section 4, 
sufficient conditions for the existence of an optimal solution are developed, 
and then applied to two examples in Section 5. Finally, in Section 6, we 
discuss possible extensions and directions for further research. 

2. The model. In order to better understand the two-dimensional model, 
we review the change-point problem on R + considered in [5]. We have a 
nonexplosive point process N = {Nt,t € R+} on R + , and a random time a > 
0. Given a, N is a Poisson process with intensity fiQ on [0, a) and intensity 
fii on £ = [a, oo) (hi > fJ,o > 0). Modifying the notation of [5] slightly, the 
gain function at t is defined by 

(1) Z t = co(t A a) - ci(t - a) + + k + kil{ t > a y, 

where cq > 0, c\ > and k\ > 0. The parameters can be interpreted as fol- 
lows: the gain function is piecewise linear, increasing at rate cq before the 
jump point and decreasing at rate c\ after. When h\ > 0, a penalty equivalent 
to —k\ is incurred for stopping the process before the change has occurred. 
The gain is maximized when t = a. 
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Let T = (J~t,t 6 R+) denote the nitration which characterizes the under- 
lying information available (in [5], the process ./V is always J-~-adapted). For 
various nitrations, it is shown in [5] that Z± has a smooth semimartingale 
(SSM) representation with respect to T: 

(2) Z t = Z + I U s ds + M t , 



Jo 

where M is an ^-martingale and U is ^-progressive (i.e., observable). If U 
is monotone in the sense that Ut < =/- Ut+h < OV/i > 0, then it is straight- 
forward to see that (cf. [5], Theorem 1) a := ini{t:Ut < 0} is an optimal 
^-stopping rule for Z in terms of expected values: we have 

(3) E[Z$] = sup{E[Z T ] : r an ^-stopping time}. 

To motivate the model on R^_, we will rewrite (1) in terms of the single 
jump point process L t = I{a<t} an d the random set £ = [a, oo) = {t : L t > 0}: 

Z t = co\A t n £ c | - ci \A t n f | + k + feiLt 

(4) 

= h+ {-ci + (c + ci)X u )du + kiL t , 
J A t 

where At = [0, t] , | • | denotes Lebesgue measure and X t = 1 — I{te£} — ^{ L t =o} ■ 
We are now ready to describe the two-dimensional model. We are given a 
random Borel set £ C (0, oo) 2 . N is a nonexplosive point process on R 2 such 
that given £, N is Poisson with intensity /xo ° n £ c an d /ii on £. It is always 
assumed that fii> fio> 0. (The case /io = will be briefly discussed at the 
end of Section 4.) We will assume that the set £ is generated by a single line 
point process L: that is, L is a nonexplosive point process whose jump points 
are all incomparable (s,t £ R 2 ^ are incomparable if both s £ t and t^s). It 
is noted in [7] that in two or more dimensions, the single line process is the 
natural generalization of the single jump process, and in analogy with the 
change-point model on R + , we define £ := {t : L t > 0}. We observe that £ is 
an upper layer (£ is an upper layer if t € £ =>■ s € £ Vs > t). When L has only 
one jump point a, we observe that £ consists of the points to the northeast 
of a. This is illustrated in Figure 1. The more general situation in which L 
is a single line process is illustrated in Figure 2. In this case, £ consists of 
all the points to the northeast of one or more jump points of L. 

Using notation similar to that used for the one-dimensional problem, for 
t £ R+ let A t = {s € R+ : s < t} and X t = l- I{te£} = I{L t =o} ■ Tne definition 
of the gain function at t £ R 2 ^ is exactly the same is in (4): 

z t = c \A t n £ c | - ci \A t n £| + k + hL t 

(5) 

= ko+ (— ci + (c + ci)X u )du + kiL t . 
J At 
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Once again, we assume that en > 0, c\ > and ki > 0, and that | • | denotes 
Lebesgue measure on Rl. 



©- 



= observed points of N, q = unobserved points of L 
Fig. 2. A change-set £ generated by a single line process L. 




= observed points of N, q = unobserved points of L 
Fig. 3. A lower layer B and the change-set £. 
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Any point process iV can be indexed by the Borel sets in R+. As in the 
Introduction, if {r n } denotes the jump points of N numbered in some arbi- 
trary way, then for any Borel set B, N(B) := Y^=i^i T n £ B). [Therefore, 
we have Nt = N(At).] Consequently, we can define the gain function more 
generally over the class of lower layers C: a set B C is a lower layer if 
t e B A t C B Mt G R 2 + . The gain function at B G C is defined as 

Z(B) = c \Bn£ c \-ci\Bn£\ + k + k 1 L(B) 

(6) 

= h+ (-c 1 + {c + c 1 )X u )du + k 1 L(B). 
Jb 

A lower layer B and the change-set £ are illustrated in Figure 3; we observe 
that L(B) = 2 in this case. 

We see that the gain function defined in (6) is a natural generalization 
of the one-dimensional gain function (1). The gain evaluated at B increases 
in proportion to the area of B outside of the change-set £, and decreases in 
proportion to the area inside of £. When k\ > 0, there is a penalty incurred 
that is equivalent to —k\ times the number of points in L that lie outside of 
(or "after") B. The gain is maximized when B = £ c . 

We would like to find a random lower layer that maximizes the expected 
value of the gain function. The lower layer will depend on the available 
information, or more precisely, the underlying filtration. 

A class of cr-fields JF = {J~t,t € R/j.} is a filtration if: 

• T is increasing: s <t=? T s QJ-t, and 

• T is outer-continuous: Ft = (~) n Ft n for every decreasing sequence (i n ) C 
R^ with t n it. 

Definition 2.1 (Cf. [6]). A closed random lower layer p is an J-"-stopping 
set if 

{tepjeFt VteR^_. 

The general optimal set-detection problem in two dimensions can now be 
stated as follows: for a given filtration J 7 , our goal is to maximize E[Z p ], 
where p is an ^-stopping set. If it can be shown that a stopping set p exists 
that satisfies the condition 

(7) E\Z(p)\ = sup{E[Z(p)] : p an ^-stopping set}, 

then our optimal estimate of £ is £ = p c [(•) denotes set closure]. It is trivial 
that £ is an upper layer, and by outer continuity of J 7 , it is easily seen that 
£ is also an adapted random set (i.e., {t G £} G J-jVt G R+). 

In this paper, we will be focussing on the sequential estimation problem: 
that is, we will be assuming that Tt = = o~{N s : s < t}. If p is an T N - 
stopping set, then I(t G p) is a function of the number and locations of jump 
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points of N in the set At. For technical reasons, we shall see that in general 
it is necessary to restrict the detection problem to a bounded rectangle 
R = [0, r] 2 . The goal is to find a stopping set pQ R that is optimal in the 
following sense: 

Definition 2.2. An J^-stopping set p is called an optimal solution to 
the sequential detection problem on R provided that p satisfies the following 
equation: 

(8) E[Z{p)} = sup{E[Z(p)} : p C R an /"^-stopping set}. 

Restricting our attention to R ensures that p is bounded and so E[Z(p)] 
is always well de fined . In this case, we have an optimal estimate £r of £ fl R, 
defined by £r = R\ p. 

3. Mathematical preliminaries. In this section we present the mathe- 
matical tools needed in the sequel. In [5], Herberts and Jensen make use 
of martingale techniques to provide a simple and elegant method of finding 
sufficient conditions for the existence of an optimal solution to the detection 
problem on R + . Martingale methods have been extended to more general 
spaces in [6], and we are able to exploit this theory in a similar way. To 
motivate the necessary technical details that follow, we first describe our 
overall plan of attack. Recall that J- N denotes the filtration representing 
the data that can be observed, and below Q will denote a larger filtration 
containing additional information, some of which cannot be observed. 

Plan of attack: 

• The gain function Z can be rewritten as a (two-parameter) semimartingale 
(Definition 3.7): 

Z B = k + / U t dt + kiM B , 

JB 

where M is a weak martingale (Definition 3.1) with respect to a filtration 
Q and U is ^-adapted but not necessarily observable (cf. Lemma 3.14). 

• For the observable filtration T N and p an J-^-stopping set, we have 
E[M p ] = (Lemma 3.6) and if V t = Ept^} (observable), then 
(Lemma 3.10) 



E[Z p ] = k + E 


I U t dt 


= k + E 


I V t dt 




Jp 




Jp 



• If V satisfies a monotonicity property on R (cf. Definition 3.8 and 
Lemma 3.10), then there exists an optimal solution p to the sequential 
detection problem on R, and the optimal estimate of £ H R is 

ZR = {teR:V t <0}. 



8 



B. G. IVANOFF AND E. MERZBACH 



Keeping this outline of our approach in mind, we continue with the nec- 
essary mathematical details. 

3.1. Martingale preliminaries. Martingales on can be defined in var- 
ious ways (cf. [6]), but here we need only the weakest definition. In what 
follows, T denotes either R/j_ or a bounded region R = [0,r] 2 , and (fi, J-, P) 
is a complete probability space equipped with a T-indexed filtration T = 
{J~t '■ t G T} (without loss of generality, assume that Tt contains all the P- 
null sets Vt G T). A T-indexed process X = {X t : t G T} is adapted to T if X t 
is Tj-measurable, for all t G T. For any T-indexed process X = {Xt :t G T}, 
for s = (si, S2) < (*i , ^2) = t G T, define the increment of X over the rectangle 
(s,i\ = x (52,^2] in the usual way: 

X(s,t] =X( tut2 ) -X( ai)f2 ) -X (tliS2 ) +X (si:S2 ). 



Definition 3.1. Let M = {M t :t G T} be an integrable process on T, 
adapted to a filtration J-"= {Ft - t G T}. M is a weak J 7 -(sub)martingale if 
M is equal to on the axes, and for every s < t G T, 

£[M( S ,i]|j;] = (>)0. 

(A process A" is integrable if -E[|Xt|] < 00 Vt G T.) 

Definition 3.2. Let v = {v t : t G T} be a function on T. We say that v 
is increasing (decreasing) if: 

• v is on the axes, 

• v is outer continuous with inner limits: that is, v is continuous from above 
and with limits from the other three quadrants at each t£T, and 

• for every s < t G T, v(s,t] > (<)0. 

A process V = {Vt :t G T} is increasing (decreasing) if for each weSl, the 
function V.(uj) is increasing (decreasing). 

Comment 3.3. An increasing function v can be regarded as the distri- 
bution of a measure on R^_. Therefore, v(B) is well defined for any Borel 
set B, where we use v. and v(-) to denote, respectively, the function and 
the generated measure. Likewise, a decreasing function generates a negative 
measure, and we will use similar notation. 

Definition 3.4. Let L be a weak J^-submartingale. An increasing pro- 
cess A is a compensator for L if A is ^-adapted and M = L — A is a weak 
martingale. 
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Comment 3.5. As defined above, the compensator of a submartingale 
need not be unique (any increasing process is trivially a compensator for 
itself). A type of predictability is required for uniqueness (cf. [6]), but this 
point is not of importance here. 

In light of Comment 3.3, the following lemma is a special case of of 
Lemma 3.3.5 of [6]. 

Lemma 3.6. If M is a weak martingale which can be expressed as the 
difference of two increasing integrable processes, and p is a stopping set such 
that pCR=[0,r] 2 , then M (p) is well defined and E[M(p)] = 0. 

Definition 3.7. Let Z = {Z t :t G T} be a process on T, adapted to a 
filtration T = {J~t ■ t € T} . Z is a smooth semimartingale with respect to T 
(.F-SSM) if it satisfies a decomposition of the form 



for each t = (ii,i2) £ T, where U is an outer continuous process with inner 
limits adapted to T and M is a weak J^-martingale. We denote the .F-SSM 
as Z = (U,M). 

In order to show that an optimal solution exists to the sequential detection 
problem, we will require a monotonicity property. 

Definition 3.8. A function v = {vt - t € T} is monotone on T if v s < 
=>■ vt < Vf > s £ T. A process V is monotone if V.{uS) is monotone for each 
co € 0. 

Comment 3.9. 

1. Note that any decreasing function is monotone, but the converse is not 
true. 

2. If a process V is decreasing in each parameter separately on T, then V 
is monotone on T but not necessarily decreasing in the sense of Defini- 
tion 3.2. 

3. Note that if V is monotone, then V t > =>• V s > Vs < t. 

4. If V is monotone and adapted to a filtration J 7 , the set 



is an ^-stopping set (cf. Definition 2.1). [s = (si,S2) *C (£1,^2) = t <^ < 
ti if ti > 0, and Sj = if ti = 0, i = 1,2.] Clearly, p is a random closed 



(9) 




(10) 



p={teT:V s >0Vs«t} 
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lower layer, and the fact that V is adapted ensures that {t G p} G Tt'- 
taking any sequence (t n ) \ t with t n <^t, by monotonicity it follows that 



{t£p} = f}{V tn >0}e{jT t , 



In [5], the solution to the optimal stopping problem is based on a SSM 
representation of the form (2), which in turn is based on a projection theo- 
rem. The question of the existence of optional and predictable projections in 
higher dimensions is a delicate one, usually requiring a strong assumption of 
conditional independence on the underlying filtration [denoted (F4) in the 
two-dimensional literature]. For details, see [12], for example. In practice, 
one can generally show directly that a suitable projection exists without 
relying on a general existence theorem, and for our purposes the following 
lemma will be adequate. 



Lemma 3.10. Let U be a bounded T -indexed process adapted to a filtra- 
tion Q such that U is outer- continuous with inner limits. If J- is a subfil- 
tration of Q (i.e., Tt C Q t \/t), and if a version of Vt = E[Ut\J-t] exists that 
is outer- continuous with inner limits, then for any J- -stopping set p C R = 
[0,r] 2 , 



(11) 



E 



U t dt 



E 



V t dt 



In addition, if V is monotone on R, then the T '-stopping set p C R defined 
by 

(12) p = {t eR:V s >0Vs<i} 

is optimal in the sense that 



E 



U t dt 



sup|-E 



U t dt 



■ pQR,P an T -stopping set 



Proof. First, the assumption that U and V have sample paths that are 
regular (outer-continuous with inner limits) and that U (and hence V) is 
bounded ensures that the integrals and expectations in (11) are well defined. 

Next, let T n := {(^r, ir) : < i, j < 2 n } denote the "dyadics" of order n 
in R. The class of rectangles C n partitions R, where C G C n if C is of the 
form C = A t \ (Usgt„ s^t^s) for some t G T n . Let tc- = inf{i G C} denote 
the lower left corner of C . We now define the "discrete" approximation p n 
of p by 

Pn = |J C. 

C£C n :t c _£p 
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It is straightforward that p n C R is an ^-stopping set, that (p n ) is decreasing 
in n and p = f] p n . Boundedness and uniform integrability ensure that 
E[J U t dt] = lim n E[J U t dt] and E[J V t dt] = lim n E[J V t dt] . To complete 
the proof of the first statement in the theorem, observe that by boundedness 
of U, 



E 



j U t dt =E Yj htc-e P } [ Utdt 



E 



E 



E 



E 



{*c-ep} 



E 



C&Cn 



Utdt 



c 



to- 



cec n 

2 htc-ep} 
c&c n 

E htc-ep} 
c&c n 



E 



E 



E[U t \Ft]dt 



c 



tc- 



[ Vtdt 
JC 



t c - 



[ Vdt 


= E 


[ Vdt 


Ic 




-J Pn 



The third equality above follows by Fubini and the assumption that V has 
regular sample paths, and since t G C => t > tc~- [The assumption that V 
has a version with regular sample paths ensures that V is jointly T x /3(R/]_)- 
measurable, where Z3(R+) denotes the Borel sets in R.] 

Next, assume that V is monotone. To prove optimality of p, let pC_ R be 
any other stopping set in R. We have 



>0, 



E 


1 U t dt- I U t dt 


= E 


j V t dt- j V t dt 




.J p J p 




J p\p Jp\p 



since V > on p° (the interior of p) and V < on p c . □ 



3.2. Smooth semimartingale representation of the gain function. We be- 
gin this section with an analysis of the single line process Lt- L is a non- 
explosive point process whose jump points are all incomparable. Single line 
processes and their compensators were discussed in [7], to which the reader 
may refer for more detail. Heuristically, if T$ = cr(L u :u < s), then a process 
A will be an J-~ L -compensator of L if 

A((si, s 2 ), (si + dsi,s 2 + ds 2 )]) 

« I{ Lb=0 }E[L((si,s 2 ), (si +dst,s 2 + ds 2 )])\L s = 0], 

since L cannot have any jump points in ((si,s 2 ),(si + dsi,s 2 + ^2)]) if 
L s > and {L s = 0} is an atom of Tg . Define the (deterministic) increasing 
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function := E[L(s,t]\L s = 0], for t> s, and when it exists, let 



A, = lim 



ti^si,*24^2 (ti — si)(*2 — 82) 
In particular, if X s exists for every s € T and is Lebesgue measurable, then 

(13) A t = / A u / {jLn=0 } d«. 

In what follows (and as will be seen to be the case in our examples), we 
will assume that a representation of the form (13) exists for the compensator 
A of L, and we will refer to the deterministic function A as the weak hazard 
function of L. It will always be assumed that A is continuous. 

To better understand the weak hazard, we observe that if E\L\ of L is ab- 
solutely continuous with respect to Lebesgue measure with Radon-Nikodym 
derivative A, then for every u € T with P(L U = 0) > 0, A„ = X U /P(L U = 0). 
To see this, simply observe that for each tGT, 

(14) / \ u du = E[L t ] = E[A t ] = [ \ u P(L u = 0)du. 
J A t J At 

Returning to the gain function (6), let M denote the weak martingale 
L — A and recall that X u = I{L u= oy For any lower layer B CT, 



(15) 



Z{B) = k + [ (-ci + (co + c x )X u ) du + k x L{B) 
Jb 

= h+ / (-ci + {c + c 1 + k 1 X u )X u )du + k 1 M(B). 
Jb 



We note that X is outer-continuous with inner limits by definition and that A 
is assumed to be continuous, and so we now have an J^-SSM representation 
of the gain function: Z = (U, M), where Ut '■= —c\ + (cq + c\ + ki\ t )X t . 

Comment 3.11. As a simple illustration, if the point process L and the 
set £ = {t : Lt > 0} are unobservable and no other information is available 
(i.e., N is not observed and Ft = {0,0} Vi G T), then for R = [0,r] 2 , we are 
looking for a deterministic set B C R that maximizes 



E[Z{B)\ =E 

(16) 



ko+ {-c 1 + {c G + ci+ki\ u )X u )du + k 1 M{B) 
Jb 

k + (-c 1 + (c + c 1 +k 1 X u )P(L u = 0))du. 
Jb 



Letting V u = [— c\ + (cq + c\ + k\\ u )P{L u = 0)], it is easily seen that V is 
deterministic and an optimal solution for the detection problem exists if V 
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is monotone, in which case 

B = {t€R:V u >0Vu<.t} 



(17) 

The optimal estimate of £ n R is 



t E R : = 0) > ? Cl - Vu < i 

(co + ci + fciA u ) 



e R = nei?:P(L t = 0)< 



(c + Cl + hXt 

Example 3.12 (The single jump process). Suppose L t = IryeAtJi where 
Y is a T- valued random variable with distribution F and continuous density 
/. Then we have X u = y_ F . To verify that the representation (13) is satisfied 

with this definition, observe first that E\L(s ,i\\J-g~\ = I{l s =o}- Next, 



E 



I -i ^ T {Lu=o} du T s =[ u P(L u = 0\T s )du 

J(s,t] 1 - r u J J( S)t ] 1 - t u 



[s,t] 

fu 1 E u T j 

(s,t] 1 - *u 1 - *s 

F(s,t] 

Thus, the increasing process At = J A X S I(L S = 0) ds is a ^-^-compensator 
for L, verifying (13). 

It should be noted that in the literature on bivariate survival analysis, 
the definition of the hazard function is ^ where S u = P(Y > u). For this 

reason, we refer to our hazard A = as the "weak" hazard. 

Returning to Comment 3.11, when no information is available, V is de- 
creasing and (17) defines an optimal deterministic solution if / is decreasing 
in each parameter. 



Example 3.13 (First line of a Poisson process). Consider a homoge- 
neous Poisson process J on T with rate 7. If Aj denotes the set of jump 
points of J, then the first line of J is the single line point process L with 
(incomparable) jump points 

A L =min(Aj) = {r € Aj:r' % rVr' € Aj such that r' / r}. 

In this case, £ = {t : Lt > 0} = {i : Jt > 0}. As is shown in [7], the weak hazard 
of L is 7. 

Considering the situation in Comment 3.11 when no information is avail- 
able, we have V u = —c\ + (cq + c\ + k\^)e~ lulU2 , which is clearly monotone. 
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In this case, the optimal solution given in (17) becomes 
B= IteR-.e-^ 2 > 

teR:ht 2 < 



(c + ci + /C17) 
ln(c + ci + fci7) - ln(ci) 



7 



and 



j R = It 6 R : ht 2 > ln(C0 + Cl + * l7) " MC1) 

We are now ready to return to the sequential detection problem, and 
consider the case in which the process N is observed (recall that N is 
a Poisson process with rate hq on £ c and fi\ on £). We denote the full 
filtration T L ' N = {T^ N :t G T}, where jf ,JV = ^{L s ,iV s ,s < t}, and (as 
before) the subfiltrations T L = {T^ :teT} and T N = {F^ :teT} where 
T t L = a {L s :s<t} and T t N = a{N s :s<t}. Although we defined the weak 
hazard of L with respect to J- L , it is easy to see that given the full filtra- 
tion F L,N , L — A is still a weak J^'^-martingale. This follows because on 
{L s =0} = {s£ £ c }, iV is a Poisson process with rate no on A s and so N\a s 
(N restricted to A s ) adds no additional information about the behavior of 
Lt for t>s. Formally, we have 

E[L(s, t] \Fs' N \ = I{Ls=oA s) = E[L(s, t) \^}. 

Therefore, from this discussion we have the following lemma and we are 
ready to proceed with finding an optimal solution to the sequential detection 
problem. 

Lemma 3.14. Equation (15) defines an F L ' N -SSM representation of the 
gain function Z: Z = (U, M) where Ut ■= —c\ + (cq + c\ + k\\t)X t . 

4. Optimal solution to the sequential detection problem. We consider 
the J-~ L,Ar -SSM representation of the gain function (15): 

Z{B) = k + / (-c 1 + (c + ci + k 1 \ u )X u )du + k 1 M(B). 

JB 

In order to find sufficient conditions for the existence of an optimal solution 
in the sequential case, we will be appealing to Lemma 3.10, with Q = J-" 1 '^, 
T = T N and U t = -c x + (c + a + hX t )X t . In order to find V t = E\U t \T?], 
it is enough to determine 

E[X t \T t N ] = P(L t = 0\T t N ). 
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As in [5], we use a Bayesian argument. The first step is to determine the 
conditional likelihood ^vil(^) of -^U t given L and use this to find the like- 
lihood ^Tv(i) of N\A t - Next we find the conditional likelihood ^jv|L t =o(*) °f 
N\A t on the set {Lt = 0}. Finally, we have 

E[X t \T t N ]=P(L t = 0\T t N ) 

(18) 

£ N \ Lt =o(t)xP(L t = 0) 
i N (t) 

When computing the likelihood ^/v|L> i n f ac t it is equivalent to condition 
on the random upper layer £ = {u:L u > 0}. To see this, let (U,cIh) denote 
the collection of closed upper layers in T endowed with the Hausdorff metric. 
It is shown in [8] that (U,djj) is a complete separable metric space and that 
£ can be regarded as the unique jump point in a single jump process L on 
U\ in addition, L determines and is determined by L. In particular, Lt > 
O^t^^^Et^!;, where E% = {s € T : s > t}. Let jjL^ denote the measure 
induced by £ onW. 

Given L, or equivalently £, N is a Poisson process with rate fio on £ c and 
fi\ on £. Using the well-known likelihood for the Poisson process (cf. [3], 
page 22), we have 

iN\L(t)=£ N \i{t) 
(19) = e-MM^^mAtXO^^Atn^^NiAtnO 



-Mo I At | e - (mi -mo ) I A t n£ | 



xiV(A t n C ) 



By considering separately the events {L t = 0} = {t £ £} = {£t ^ £} = {-At Pi 
£ = 0} and {L t > 0} = {t G £} = {Et C £}, we use (19) obtain 

^(t) = P(L t = 0)e-^l^l/i^ 

(20) +e -^\M^ t e -fa-n)\A t nD\ I /fi } d 

J{D£U:E t CD} V^O/ 

= e-wl A 'l^* [P(-L t = 0) + e-^-^l^lQt], 

where 

/• ✓ \JV(A t nz>) 

(21) Q t = / e (w-w)|A.\i)| a ^ (jD ). 

J{DeU:E t CD} V/W 

Before continuing, we observe that since /^i > /io, Q is increasing in each 
parameter separately because each term in the integrand is increasing in 
each component for D fixed, and the range of integration is increasing since 
the set Et decreases with each component. 
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Next, if L t = 0, then N\A t is Poisson with rate no, and so 

(22) ^|L t =o = e^ o|At| ^. 

Substituting (20) and (22) in (18), we obtain 



e -^\M^ t [P{L t = Q) + e-^-^)\MQ t ] 
(23) x 

= 1+qtQt 

where qt = 6 pp~Ql ' ' ■ [If -P(-^t = 0) = 0, (23) remains formally valid since 
-EfXtlT"^] = and qt = oo.] We are now ready to state our main result: 

Theorem 4.1. Let L be a single line process with continuous weak haz- 
ard A, and define the function q by 

9t = p(L t = 0) fort = (t 1 ,t 2 )£~R 2 + . 

An optimal solution to the sequential detection problem on R= [0,r] 2 exists 
if A and q are decreasing and increasing, respectively, in each component on 
R. In this case V is monotone on R, and the optimal solution is given by 
(12): 

p = {teR:V s >0Vs<§:t}, 

where 

1 



V t = -ci + (c + ci + h\ 



1 + qtQt 



Proof. We review our results so far. We have the J- > '-SSM rep- 
resentation of the gain function Z{B) = ko + J B Utdt + k\M(B), where 
Ut = —c\ + (co + c\ + k\\t)X t . U is bounded since A is decreasing in each 
component and X is an indicator function. By the argument immediately 
preceding the theorem, we have that 

(24) V t = E[U t \T t N ] = - Cl + (c + ci + hXt) - ■ 1 _ ■ 

i + ftvt 

To see that V has a version which is outer-continuous with inner limits 
(o.c.i.l.), recall that A is assumed to be continuous and observe that q is 
o.c.i.l. by definition. Turning next to Q, we see that the integrand in (21) is 
o.c.i.l. and increasing in each component in t, as is 

^({DEU:E t CD}) = P(L t >0). 
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/ U t dt 


= k + E 


I V t dt 


J p 




Jp 



Therefore, it follows that Q, and hence V are o.c.i.l. Therefore, Lemmas 3.6 
and 3.10 imply that for any .F^-stopping set p Q R, 

(25) E[Z(p)]=k + E 

To show that an optimal solution p exists [as in (12)], it is sufficient to show 
that V is monotone (again, by Lemma 3.10). Since we have already seen 
that Q is increasing in each component on R, the assumption that A and 
q are decreasing and increasing, respectively, in each component imply that 
V is monotone on R. This completes the proof. □ 

Comment 4.2. It has been pointed out by an anonymous referee that 
the case po = relates to a so-called support estimation problem. In this 
case, the random set £ denotes the support of a Poisson process with rate p\ . 
The gain function can be defined exactly as before, and the analysis proceeds 
in very much the same way. Now we know that Nt > =>• t G £ =>• Lt > 0, and 
equation (18) becomes 

E[X t \T t N ]=P(L t = 0\T t N ) 

(26) =p(L t = 0\N t = 0)I(N t =0) 

P(L t = 0) 



P(N t = 0) 



I(N t = 0). 



Continuing with the same sort of arguments used previously, if = 0, equa- 
tion (23) becomes 

(27) E[Xt \T»] = —^I( Nt = 0), 

i + qtQt 

where qt is defined as before with po = 0, and 

(28) Q t = [ e^^dp^D). 

J {D&A:EtQD} 

It is easy now to see that the statement of Theorem 4.1 is still valid in this 
case, with V replaced by V, where 

Vt = -ci + (c + ci + hX t ) —^I(N t = 0). 

i + qtQt 

5. Examples. In this section, we apply Theorem 4.1 to our two examples. 
We will see that in some sense they are are both analogous to the univariate 
model of [5], in which the change-point is exponentially distributed. There 
are two natural generalizations in R/j_: first, L is the single jump process in 
which the components of the jump are independent univariate exponential 
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random variables, and second, L is the first line of a Poisson process, noting 
that an exponential random variable can be regarded as the "first line" of 
a Poisson process on R + . Although at first glance the single jump process 
looks more straightforward, we shall see that in fact the analysis is far more 
complex than in the case of the first line of a Poisson process. 

Example 5.1 (The single jump process). Referring to Example 3.12, 
we have Aj = 1 l^ Ft and qt = 6 ^LJ^Jl! 2 _ Here we will consider the case in 
which the components (Yi,Y 2 ) of the jump Y are independent identically 
distributed exponential random variables with parameter 7. In this case, 

ft = 7e -7*i 7e -7fe = 7 2 

* l-(l-e-T*i)(l- e -T*2) e 7ti+ e 7*2_i' 

and is decreasing in each component. Next, we consider qt'. 



g— (w~ Mo)*l*2 g-(A t l-Mo)il*2 



qt 



1 - (1 - e-'T' 1 )(l - e"T* 2 ) e"^ 1 + e"^ 2 - e -">(* 1+i2 ) ' 

To find sufficient conditions to ensure that q is increasing in t\ and t 2 on 
some set R = [0, r] 2 , we will assume that 7 > fii — fiQ and to simplify the dis- 
cussion (without loss of generality, by suitably rescaling the time parameters 
if necessary) that fi\ — fiQ = l. Now rewrite qt = l/gt where 

9t = 9(t u t 2 ) = e-^-^il - e~^) + e^ 2 e~^. 

We will show that if r < then ^g( tl ,t 2 ) < for (*i,*2) €R=[0, rf . By 
symmetry, the same is true for ^9(ti,t2) I ' or * € i2. Therefore, g is decreasing 
and q = 1/g is increasing in each component on R, and an optimal solution 
exists for the sequential detection model. 
To complete the example, we observe that 

i 9 {h ,t 2 ) = e-^" i2 )(-( 7 - t 2 )(l - e~^) + taeTfr-**)) < 



( 7 -t 2 )(l- e -7*2)> t2 g7(*i-* 2 ) 



dt x 

if and only if 
or equivalently, 

(29) e 7 * 1 < (e^ 2 



The left-hand side of (29) is bounded above by 7 since t\ < r < The 
right-hand side of (29) is bounded below by 7 since t 2 < r < < 7 — 1 when 
7 > 1, and so ^(e 7 * 2 - 1) = ^7^(7 - h) > 7(7 - t 2 ) > 7. Therefore, it is 
sufficient that t\ , t 2 < r < — . 
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Example 5.2 (First line of a Poisson process). From the discussion 
in Example 3.13, if L is the first line of a Poisson process with rate 7, 
then A = 7, and so trivially is decreasing in each component. We have qt = 
- — ^I 7t M 1 ° t ) 2 1 2 = e*- 7- ^ 1- ^ ^' 1 * 2 , which is increasing in each component if 7 > 
Hi — Ho- Therefore, an optimal solution to the sequential detection problem 
exists on any bounded set R = [0,r] 2 if 7 > Hi — Ho, and is defined by (12). 
In fact, this is exactly the same as the sufficient condition for the univariate 
detection problem proven in [5] and [9]. 

6. Conclusion. As indicated in the Introduction, the sequential detection 
model considered here is only one of many scenarios that should be analyzed 
in the general context of the "optimal set-detection problem." Indeed, the 
model can be extended in many possible ways. 

• The information structure: In addition to the sequential information model, 
Herberts and Jensen [5] consider what they call the "ex-post" analysis. 
This would correspond to observing N on all of R, and then trying to 
optimize the expectation of the valuation function. (Formally, this cor- 
responds to J~t = J-r rr \Vt S R-) Several variants or combinations of the 
ex-post and sequential schemes can be studied. 

• The underlying space: We worked here on a bounded subset of R^_. 
It would be of interest to consider change-point problems on higher- 
dimensional Euclidean spaces or more general partially ordered sets as 
in [8]. 

• The change mechanism: Here the change occurs at either a single random 
point or at the first line of a more general point process. The example 
involving the first line of a Poisson process turned out to be (perhaps 
surprisingly) the more natural analog of the one-dimensional exponen- 
tial change-point problem. Consideration should be given to more general 
single jump and first line processes, as well as to more general random 
sets (not necessarily upper layers). For example, the case in which L is 
the first line of an inhomogeneous Poisson process with intensity 7Q is 
considered in [2] where it is proven that an optimal solution exists if 
inf uefl 7(n) >Hl~ Ho- 
rn The observed process: On R + , the process subject to the change can be 

a more general process, such as the Brownian motion process (cf. [1]). 
Here too, we can consider more general processes such as the set-indexed 
Brownian motion (cf. [6]). 

• The parameters: In our analysis, it is implicitly assumed that the param- 
eters of the various processes are all known. How does one approach the 
problem when one or more parameters must be estimated? 

• The gain function: Different valuation functions can be chosen, thereby 
changing the notion of optimality. For example, with a change generated 
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by a single jump at Y, instead of two cost parameters cq and c\ associated 
respectively with E Y and Ey, we could have different costs in each of 
the four quadrants defined by Y. Another variation considered in [2] is 
to replace Lt in (5) with l{Lt > 0). Although this does not change the 
valuation when the change is generated by a single jump, the analysis 
becomes more complex when L is the first line of a Poisson process. 
• Number of changes: Here we deal with only one change-set. However, we 
can imagine that several changes occur on a decreasing sequence of ran- 
dom upper layers, for example. This would correspond to multiple change 
points on R + . 
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